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Abstract 

This article contains a proof of the MDS conjecture for k < 2p — 2. That is, that 
if S is a set of vectors of in which every subset of S of size k is a basis, where 
q = p h , p is prime and q is not and k < 2p — 2, then | *S' | < q + 1. It also contains a 
short proof of the same fact for k < p, for all q. 

1 Introduction 

Let S be a set of vectors of in which every subset of size k is a basis. 

In 1952, Bush [|2l showed that if k > q then \S\ < k + 1 and the bound is attained if 
and only if S is equivalent to {ei, . . . , e^, e\ + . . . + e^}, where {ei, . . . , e^} is a basis. 

The main conjecture for maximum distance separable codes (the MDS conjecture), 
proposed (as a question) by Segre [9] in 1955 is the following. 

Conjecture 1.1. A set S of vectors of the vector space with the property that every 
subset of S of size k < q is a basis, has size at most q + 1, unless q is even and k = 3 or 
k = q — 1, in which case it has size at most q + 2. 

In this article we shall prove the conjecture for all k < 2p — 2, where q = p h , p is 
prime and q is not prime. 

We shall also prove the conjecture for q prime, which was first proven in HI. It may 
help the reader to look at the first four sections of [[II, although this article is self-contained 
(with the exception of the proof of Lemma [27Tb and can be read independently. The proof 
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here is based on the ideas of (D which themselves are based on the initial idea of Segre 
in®. 

For a complete list of when the conjecture is known to hold for q non-prime, see flU 
and also [0. 

The best known bounds, up to first-order of magnitude (q are constants), are that 
for q an odd non-square, the conjecture holds for k < y^pq/4: + c\p, Voloch [fTTTl . For 
q = p 2h , where p > 5 is a prime, the conjecture holds for k < y/q/2 + c 2 , Hirschfeld 
and Korchmaros [3], and here we shall prove the conjecture for k < 2y/q + c 3 in the case 
q = p 2 . The conjecture is known to hold for all q < 27 and for all k < 5 and k = 6 with 
some exceptions. 

Conjecture [Tj] has implications for various problems in combinatorics, most notably 
for maximum distance separable codes (whence the name) from coding theory and the 
uniform matroid from matroid theory. 

A linear maximum distance separable code is a linear code of length n, dimension k 
and minimum distance d over ¥ q , for which d = n — k + 1. Conjecture 11.11 implies that a 
linear maximum distance separable code has length n at most q + 1 unless q is even and 
k = 3 or k = q — 1, in which case it has length at most q + 2. For more details on codes 
and MDS codes in particular, see Q. 

A matroid M = (E, F) is a pair in which E is a set and F is a set of subsets of E, 
called independent sets, such that (1) every subset of an independent set is an independent 
subset; and (2) for all ACE, all maximal independent subsets of A have the same 
cardinality, called the rank of A and denoted r(A). The maximal independent sets of the 
uniform matroid of rank r are all the r element subsets of the set E. Conjecture l 1 . 1 l implies 
that the uniform matroid of rank r, with \E\ > r + 2, is representable over ¥ q if and only 
if \E\ < q + 1, unless q is even and r = 3 or r = q — 1, in which case it is if and only if 
\E\ < q + 2. For more details on matroids and representations of matroids in particular, 
see Id. 

2 The tangent function and the Segre product 

For any subset Y of k — 2 elements of S, since there are at most k — 1 vectors of S in a 
hyperplane, there are exactly 

t = q + 1- (\S\ - k + 2) = q + k- 1 - \S\ 

hyperplanes containing Y and no other vector of S. 

We shall assume throughout that t > 1, which is no restriction since we are trying to 
prove \S\ < q + 1 for k > 4. 
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Let <pY be a set of t linearly independent linear maps from to ¥ g with the property 
that for each a G <py, Ker(a) is one of the t hyperplanes containing Y and no other 
vector of S. 

The tangent function at Y is defined (up to scalar factor) as 

and is a map from F* to ¥ q . 

The following is a coordinate-free version of Segre's lemma of tangents IfTOl and is 
from H). 

Lemma 2.1 . Let D be a set ofk — 3 elements of S. For all x,y, z e S \ D 

T {x}uD (y)T {y}uD (z)T {z}uD (x) = (-l) t+1 T {x}uD (z)T {y}uD (x)T {z}uD (y) . 

Since we wish to write det(A) where A = {a 1; . . . , a^} is a subset of S, to mean 
the determinant det(a 1; . . . , a^), we order the elements of S from now on. We write 
det(v4 1; . . . , A r ) to mean the determinant in which the elements of A 1 come first, then 
the elements of A 2 , etc. 

The following, which follows from interpolating the tangent function, is also from 0]. 

Lemma 2.2. If\S\>k + t>k then for any Y of size k — 2 and E of size t + 2, disjoint 
subsets of S, 

o = j2 t y (°) n det ^' a > 

a£E z£E\{a} 

Let A = (a 1; . . . , a n ) and B = (b , . . . , & n _i) be two subsequences of S 1 of the same 
length n and let D be a subset of S \ (A U B) of size — n — 1. 
We define the Segre product of A and B with base D to be 

Pd{A B) — J^J r ^D\j{a 1 ,...,a i ^ 1 ,b i ,...,b n ^i}{. a i) 



i=1 ^ 1 DU{ai,...,a l _i,fe i ,...,6 n _i}(^-l) 

and P D (0,0) = 1. 

The following lemmas are a consequence of Lemma [2T1 

Lemma 2.3. 

P D (A*,B) = (-l) t+1 P D (A,B), 
where the sequence A* is obtained from A by interchanging two elements. 

Proof. It is enough to prove the lemma for two adjacent elements in A since the trans- 
position (J t) can be written as the product of 2(£ — j) + 1 transpositions of the form 

(nn + 1). 
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The only terms in the Segre product which differ when we interchange ctj and a J+1 are 
the terms in the product for i = j and i = j + 1. Trivially 

TDU{a 1 ,...,a j - 1 ,bj,b j+1 ,...,b n - 1 }( a j) ^DU{ai ,...,aj_i,aj ,b j+1 ,...,fe n _i} ( a j+l) 
TDU{a 1 ,...,a,j- 1 ,b j ,b j+1 ,...,b n ^ 1 }(bj-i) Tou{a 1 ,...,aj- 1 ,aj,bj +1 ,...,b„^ 1 }{bj) 

is equal to 

TAu{bj}{a>j) T Au{aj} (a j+1 ) 



where A = DU {a 1; . . . , . . . , & n -i}> which is equal to 



^t+l ^AUfejjOj+l) T Au{a J+1 }( a , 



(-1) 

7Au{bj}(^-l) 7Au{a J+1 }( L 



, ! i r \ J i) 



by Lemma HHJ □ 

In the same way the following lemma also holds. 
Lemma 2.4. 

P D (A, B*) = (-l) t+1 P D (A,B), 
where the sequence B* is obtained from B by interchanging two elements. 

The following lemma will also be needed. 

Lemma 2.5. If A and B are subsequences of S and \A\ — \B\ — 1 then 

^M? DuW ({x}UA5) = (-l) W P fl uH(MuA,5). 
J-dub{x) 

Proof. Using the definition of the Segre product and Lemma [2TT1 

Tf, 7-T^DU{y}[{x\UA,B) = — - — P D U{x,y} [A, B \ {b \) 

^ DU{b 1 ,...,b n . 1 ,x}{Oo) 



□ 



3 The main lemma 

For any subset B of an ordered set L, let a(B, L) be (t + 1) times the number of transpo- 
sitions needed to order L so that the elements of B are the last \B\ elements. 
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Lemma 3.1 . Let A of size n, L of size r, D of size k — 1 — r and f2 of size t + 1 — n be 
pairwise disjoint subsequences of S. Ifn < r < n + p — 1 and r < t + 2, where q = p h , 
then 

J2(-iy iB > L] PDu(L\B)(A,B) n tet(z,A,L\B,D)- x = 
bcl zenuB 

\B\=n 

(.^(r-nXrrt+n+i) ^ P D (AUA,L) J] det(z, A, A, -D) -1 . 

ACQ 2G(Q\A)UL 
|A|=r-n 

Proof. By induction on r. The case r = n is straightforward. 

Fix an x E L and apply the inductive step to L \ {x} and {x} U D, 

J2 (-1)^.AW) Pdu(AB)(A)jB ) JJ det(z,Ai\(SU{a:}), a r,D)- 1 = 

BCL\{x} zeQUB 
\B\=n 

( _ 1){ r-n-l)(n i+ n + l) ^ P Du{x} (A U A, L \ {x}) J] det (*, A A, X, D) 1 . 

acq ze(n\A)uL 

|A|=r— n-l 

Let A be a subset of of size r—n—1. The set fl\A has size t+1— n— (r— 1) = t+2— r 
and so since r < t+2 we can apply Lemma l2T2l with E = LU(f2\ A) and F = DUAuA, 
and get 

= ^T DuAuA (x) JJ det^AA^r 1 

zGL ze(n\A)U(L\{a:}) 

+ £ T DuAuA (y) J] det^.AA,!/,!))- 1 . 

yen\A ze(n\({y}uA))uL 

Multiply this equation by P D (AUAUd, -^)T DUj4uA ((i)~ 1 for some d for which T Du au a (d) ^ 
0. By Lemma [2~4l we can rearrange L so that the last element is x, which changes the sign 
by a(x, L). This gives 

= Y,(- l Y [x ' L)p D^{A UA,L\ {x}) J] det(*, A, A, x, D)~ x + 

xeL ze(n\A)u(L\{x}) 

£ Pd(AUAU{2/},L) J] det^AA^,^)- 1 , 
yen\A 2e(n\(Au{y}))uL 

since 

P D (A U A U {d}, L)T DuAuA (x)T DuAuA (d)- 1 = P DLte (A UA,L\i) 
and by Lemma [231 (and Lemma [231) 

P D (A U A U {4, L)T i3uAuA (y)T Du4uA (d)- 1 = P D (A UAUf/,L). 
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Note that in the second term we can order A U {y} in any way we please without changing 
the sign since, by Lemma l23l interchanging two elements of A U {y} in 
Pd(A U A U {y}, L) changes the sign by (— exactly the same change occurs when 
we interchange the same vectors in the product of determinants. 

Therefore, when we sum this equation over subsets A of of size r — n — 1 and apply 
the induction hypothesis, we get 



= ^(_l)-(*.£)+(r-n-l)(n* + n+l) ^ (_l)^AW) P Du(L \ B) (A, B) 

x&L B<zL\{x} 

\B\=n 



fl det{z,A,L\{BVJ{x}) 1 x,D)- 1 - 



z€QUB 

(r-n) Pd(AUA,L) ] [ det(z, A, A, D)~ x . 

Acs] ze(n\A)ui 

\A\=r-n 

Since 

a(B, L) = a(x, L) + a(B, L \ {x}) + a(x, L\(BU {x})) + n(t + 1), 
this equation gives 

( _ 1) (r-n)(nt + n+i) (r _ n) £ (-l)<W)p cu(AS) (A, B) J] det(z, A, L \ B , D)' 1 

bcl zenus 

\B\=n 



[r — n) 



P D (AUA,L) Yl det(z,A,A,Dy\ 



acq ze(f2\A)uL 

\A\=r-n 

which is what we wanted to prove. □ 
Theorem 3.2. Ifk < p then \S\ < q + 1. 

Proof. If \S\ = q + 2 then t = k — 3. If q is prime then, by HI Lemma 5.1], we may 
dualise in F^ +2 , if necessary, to assume that k < (q + l)/2 and so k + t < q + 2. 

Since A; + t<g + 2we can apply Lemma I3T1 with r = t + 2 = k — 1 and n = and 

get 

JJdet^L)" 1 = 0, 

z£fl 

which is a contradiction. □ 
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4 The case |5| = q + 2 and q is non-prime. 

For any subsequence X = {x 1 , . . . , x m } of S and r C {1,2,..., m}, define the subse- 
quence X r = {xj | z G t}. 

Lemma 4.1. Suppose that \S\ = q + 2 an J n > k — p. Let A of size n — m, L of size 
k — 1 — m, Q of size k — 2 — n, X of size m, Y of size m be disjoint subsequences of S. 
Then 

0= ^(-l^^^^^^un.BU^) 

BCL rCA/ 
\B\=n—m 

2Gf2UBUX T uy M \ T 

where M — {1, . . . , m}. 

Proof. By induction on m. For m = this is Lemma [3TT1 with r = i + 2 = A; — 1, which 
gives the bound n > k — p. 

Suppose that X and V have size m and that x,y E S are not contained in X, Y, L or 
A. We wish to prove the equation for X U {x}, Y U {$/}, L and A, where |L| = k — 2 — m 
and |v4| = n — m — 1. 

Apply the inductive step to {y} U L, A U {a;}, X and Y. 

Writing the first sum as two sums depending on whether B contains y or not, we have 

0= £ ^ ( _ ir ( B ,^(^,,) +H p (AB)uWu ^ (i u { , } u B u 1t) 

BCL tCM 
\B\=n—m 

x } [ det(z, A,x,X M \ T , Y T ,y,L\ By 1 

zeUuBUX T UY M ^ r 

+ E E(- 1 ) CT(Mus,Mui)+ff(XT,x)+Mp (A^ 

BCL tCM 
|B|=n-m-l 

x [[ det(z,A,x,X MV ,Y T ,L\B)- 1 . 

z£QUBUX T UY M \ T U{y} 

By Lemma [27J1 then Lemma [231 and then Lemma [2751 again, we have 

^^P(L\B M y}ux MV (A U {x} UY T ,BU X T ) = 

(-l) ( ^ m+1)(t+1) ^^^(AB)uMux AAT ({x} U A U Y T , B U X r ) = 
( _ 1)( n- m }(w)p (ABp u U Y T , B U X r ) 
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= (-l) t+1 P { L\B)uMux M ,M U {y} UY T ,BU X T ), 
and by Lemma [2731 and the definition of the Segre product 

^^\P(L\B)ux MV (A U {x} U Y T , {y} UBUX T ) = 

( _ 1)( n- ro+ l)(W)^M F(AB) ^ {{x} UAU^WUBU X T ) = 
J-LUX{X) 

(-l) (n ~ m+m+1) P(L\Bpx MV u {x} (A UY T ,BU X T ). 
Thus, multiplying the equation before by T LvJ x{y)T LvJ x{x)~' 1 and noting that 

a({y} U B, {y} U L) = a(B, L) + (k - n - l)(t + 1), 

we have 

o= E J2(- 1 ^ B ' L)+aiXT ' x)+lTlp (^ 

BCL tCM 
\B\=n—m 

x I [ det(z, A,x,X M \ T ,Y T ,y,L\ B)~ l 

zenuBux T uY M \ T 

+ E E (-i) CT(B ' i)+ff(XT ' x)+|T|+(fe " m ~ 1)(t+1) ^(As)ux MXTU{:c }(^ ur T) Bu X T ) 

BCL tCM 
\B\=n-m-l 

x ] [ det(z,A,x,X MV ,F r ,L\ J B)- 1 . 

2 enuBux T uy M \ T u{?/} 

Applying the inductive step to {x} U L, A U {?/}, X and K and writing the sum as two 
sums depending on whether B contains x or not, gives an equation similar to the above. 
The first sum in both equations vary only in the position of x and y in the determinants. 
Switching these in the above, multiplying by (— l) m , and equating the two second sums 
gives, 

E E (-l) CT(B ' L)+ff(XT ' X)+|T|+( ^ m)( ' +1) ^(AB)ux MXT u { ,}(A U Y T , B U X T ) 

BCL tCM 
\B\=n-m-l 

x ] [ det(z, A, x, X M \ T , Y t , L \ B)~\ 

ze£lUBUX T UY M \ T U{y} 

E E (-^"^'^^^'^""^^^^^^^ux^^U^lun, {x}UBUX T 



BCL tCM 
\B\=n— m— 1 



x Yl det(z,A,y,X MV ,Y T ,L\B) \ 

z£ClUBUX T UY M \ T U{x} 
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Note that on the right-hand side of the equality we use 

a({x} U B, {x} U L) = a{B, L) + (k - n - l)(t + 1) 

Rearranging the order of the vectors in the Segre product of the right-hand side (applying 
Lemma [2731 and Lemma [274]) and the vectors in the determinants gives 

E E (-i)^^) + ^^+M-H(Hi) P(AB)UMrUw(j 4 U7 T1 BU X r ) 

BCL tCM 
\B\=n— ?7i— 1 

x JJ det(z,A,X MV ,x,y r ,L\ J B)- 1 . 

«enuBux T ur M Y T u{y} 

= E E (-1)^ )+<t(Xt,X)+|t| ^)ux mv (^ UF t U {</}, BUI T U {a;}) 

BCL tCM 
\B\=n—m—l 

x JJ det(z,A,X MV ,Y T ,y,L\B)-\ 

zeOUBUX T UY M \ T U{x} 

Finally, note that 

a((XU{x}) T ,XU{x}) = \T\(t + l) + a(X T ,X) 

and that 

a({X U {x}) rU{m+1} , X U {x}) = a(X T , X), 
from which we deduce that 

E E (-i)^^'^ 1 ^^.^ u n + , 5 u x+) 

BCL tCM M 
\B\=n-m-l 

x J] det(z,A,X+ +v ,Y+,L\Br\ 

z£flUBUX+UY + , , 

Mi \r 

= E Et- 1 )^"^^ 

BCL tCM 
B|=n— m— 1 

x J] det(z,AX+ nT+ ,y+,L\B)-\ 

where X+ = X U {x}, Y+ = Y U {y}, r+ = r U {m + 1} and M+ = M U {m + 1}, 
which is what we wanted to prove. □ 
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5 The main theorem 

The following follows from Laplace's formula for determinants. 

Lemma 5.1. Suppose thatWUL is a basis of 'F* and \X\ = n andW = {u^.u^, • • • , w n+ {\. 
Then 

n+l 

det(y, \ Wj, L) det{ Wj , X, L) = det{W, L) det(y, X, L). 

i=i 

Theorem 5.2. Ifqis non-prime and k < 2p — 2 ?/zen |5*| < q + 1. 

Proof. By Theorem 13 .21 we can restrict ourselves to the cases k > p + 1. 
Suppose IS 1 ) = g + 2 and apply Lemma |4~T1 with n = m = k — p. Then 

TC{l,...n) zenux T uY M \ T 

where \L\ = p — l,Vt = p — 2 and \M\ = k — p. 

Let W = {wi, w 2 , ■ ■ ■ , W2n} be a subsequence of S disjoint from L U X U K U E, 
where E is a subset of Vt of size p — 2 — n = 2p — k — 2. Define = w 2 , . . . , u^-}. 

We shall prove the following by induction on r < n, 

r 

= Yl (-1) |t|+,t(Xt ' X) Plux m \ t (K, X r ) J] det(i/ n+1 _,, I M \ T , Y t , L) 

rC{l,...n} t=l 

JJ det^XM^F^L)- 1 . 

2e-Bux T uy M \ T uVK„ +r 

For r = this is the above with = EUW n . Applying the inductive step with W n+r -\ = 
W r+n \ {wj}, where j e {r, r + 1, . . . , r + n}, we have 

r-l 

= ^ (-l)l T l +ff ( x - x )p LuXMXT (F T , X T ) J] det(y n+1 „ h X MV , Y T , L) 

rC{l,...n} i=l 

det(wj,X M \ T ,Y T ,L) Yl det(z,X M \ T ,Y T ,Ly 1 . 

zeEUX T UY MXT UW„ +r 

Multiplying by ( - 1 det (y n +i-r , W n+r \ (W r -i U{wj}),L), summing over j e {r,r + 
1, . . . , r + n} and applying Lemma I5TT1 proves the induction. 

For r = n every term in the sum is zero apart from the term corresponding to r = 0, 
which gives 

n 

= n det(y n+ i_ i5 X, L) detO, X, L)~\ 

8=1 z£EUY\JW 2n 

which is a contradiction. □ 



11 



COROLLARY 5.3. If q is non-prime and q — 2p + A < k < q then \S\ < q + 1. 

Proof. Suppose that \S\ = g+2.Thenby [l,Lemma5.1] we can construct a set of vectors 
S' of F^ +2 ~ fc of size q + 2 with the property that every subset of S' of size q + 2 — k is a 
basis of F« +2 - fc . □ 



6 Appendix 

Using the Segre product and the lemmas from Section [2] we can give a short proof of 01 
Lemma 4.1], the main tool used to prove that IS* | < q+ 1 and classify the case \S\ = q + l, 
for k < p, in fl}. 

Lemma 6.1. Let L of size r, D of size k — 1 — r and of size t + 2 be pairwise disjoint 
subsequences ofS.Ifl < r <t + 2 and r < p — 1, where q = p h , then 

0=^P D (A,L) J] det^.A,!))- 1 , 

ACSl 2g(C\A)u(LVo) 
|A|=r 

where £q is the first element of L. 

Proof. By induction on r. The case r = 1 follows by dividing the equation in Lemma [2T2l 
with E = Q and Y = D, by T D (£ ). 

Fix x E L and apply the induction step to L \ {x} and {i} U D, 

0= J2 Pdu{x}(&,L\{x}) J] detfoA^D)" 1 . 

acq ze(n\A)u(L\{e ,x}) 

|A|=r-l 

Let A be a subset of Q of size r - 1. Applying LemmaOwith £ = (fi U L) \ (A U {£ }) 
and y = A U D, we get 

x€L\{i } z£(n\A)U{L\{£ ,x}) 

+ T ^a(v) J] det(z,A,y,D)-\ 

y&n\A ze(n\(Au{y}))U(L\{£ }) 

Multiplying by P D (A U ci, L)T DuA (rf)- 1 for some d for which TD Uj 4 Uz \((i) ^ 0. By 
Lemma 12.41 we can rearrange L so that the last element is x, which changes the sign 
by a(x, L). This gives 

0= Yl (-l) ff(x ' L) ^u W (A,L\{x}) J] det(z,A,x,D)- 1 + 

x€L\{l } ze(n\A)U(L\{£ ,x}) 

^2 P D (AU{y},L) J] det(z,A,y,D)-\ 

y en\A ze(n\(Au{ y }))uL\{e } 
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since 



P D {A U {d}, L)T DuA {x)T DuA {d) 



P Du{x} (A,L\{x}) 



and by Lemma [231 (and Lemma [231) 



P D (A U {d}, L)T DuA (y)T DuA (d) 



-i 



P D (AU{y},L). 



Note that in the second term we can order A U {y} in any way we please with- 
out changing the sign since, by Lemma [231 interchanging two elements of A U {y} in 
Pd(A U {y}, L) changes the sign by (— 1)* +1 , exactly the same change occurs when we 
interchange the same vectors in the product of determinants. 

Therefore, when we sum this equation over subsets A of of size r — 1 and apply the 
induction hypothesis, the first sum is zero and the second sum gives 
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